Contribution to Relational Classification with Homophily Assumption

نویسنده

  • Peter Vojtek
چکیده

Relational classification is a set of methods employing relations between instances in a dataset as well as their attributes. Homophily is a phenomenon present in graphs which capture real-world data, e.g., social connections between humans. Homophily is defined as following: related (neighbouring) vertices are more likely to share similarities (e.g., the same class, attribute value) as non-related instances. Contemporary relational classifiers implicitly require homophily to be present in a graph (so called homophily assumption), however these methods are unable to determine the homophily of each node and take benefit of this information. Our work is at first dedicated to classification of relational classifiers. Next, impact of homophily assumption on particular branches of relational classifiers is analyses and then homophily measures are defined. According to this analysis, two new relational classifiers are designed. First method belongs to simple relational methods and employs local graph ranking in order to redefine neighbourhood function, second method is collective inference based and involves information exchange moderation. Both methods are capable to increase the quality of class assignment in networked data due to their capability to employ and measure homophily in a graph.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A Robust Collective Classification Approach to Trust Evaluation

In this paper, we present a collective classification approach for identifying untrustworthy individuals in multi-agent communities from a combination of observable features and network connections. Under the assumption that data are organized as independent and identically distributed (i.i.d.) samples, traditional classification is typically performed on each object independently, without cons...

متن کامل

Homophily of Neighborhood in Graph Relational Classi er

Quality of collective inference relational graph classi er depends on a degree of homophily in a classi ed graph. If we increase homophily in the graph, the classi er would assign class-membership to the instances with reduced error rate. We propose to substitute traditionally used graph neighborhood method (based on direct neighborhood of vertex) with local graph ranking algorithm (activation ...

متن کامل

Bias and variance in the social structure of gender

The observation that individuals tend to be friends with people who are similar to themselves, commonly known as homophily, is a prominent and well-studied feature of social networks. Many machine learning methods exploit homophily to predict attributes of individuals based on the attributes of their friends. Meanwhile, recent work has shown that gender homophily can be weak or nonexistent in p...

متن کامل

Approaches to Semantics in Knowledge Management

There are different approaches to modeling a computational system, each providing different semantics. We present a comparison among different approaches to semantics and we aim at identifying which peculiarities are needed to provide a system with uniquely interpretable semantics. We discuss different approaches, namely, Description Logics, Artificial Neural Networks, and relational database m...

متن کامل

Social Distance in the United States: Sex, Race, Religion, Age, and Education Homophily among Confidants, 1985 to 2004

Homophily, the tendency for similar actors to be connected at a higher rate than dissimilar actors, is a pervasive social fact. In this article, we examine changes over a 20-year period in two types of homophily—the actual level of contact between people in different social categories and the level of contact relative to chance. We use data from the 1985 and 2004 General Social Surveys to ask w...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2010